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Recently developed genomics-based tools are allowing repositioning of Food and Drug Administration 
(FDA) -approved drugs as cancer treatments, which were employed to identify drugs that target cancer stem 
cells (CSCs) of breast cancer. Gene expression datasets of CSCs from six studies were subjected to 
connectivity map to identify drugs that may ameliorate gene expression patterns unique to CSCs. All-trans 
retinoic acid (ATRA) was negatively connected with gene expression in CSCs. ATRA reduced 
mammosphere-forming ability of a subset of breast cancer cells, which correlated with induction of 
apoptosis, reduced expression of SOX2 but elevated expression of its antagonist CDX2. SOX2/CDX2 ratio 
had prognostic relevance in CSC-enriched breast cancers. K-ras mutant breast cancer cell line enriched for 
CSCs was resistant to ATRA, which was reversed by MAP kinase inhibitors. Thus, ATRA alone or in 
combination can be tested for efficacy using SOX2, CDX2, and K-ras mutation/MAPK activation status as 
biomarkers of response. 

Cancer cell subpopulations with stem/progenitor cell-like properties have been described for several solid 
tumors 1,2 . These cancer cells termed cancer stem cells (CSCs) are isolated based on differential cell surface 
marker expression and then characterized for self-renewal and differentiation properties through in vitro 
sphere assays (mammospheres) and/or tumorigenicity in non-obese diabetic/severe combined immunodefi- 
ciency (NOD/SCID) mice 1 . At least two types of breast cancer cells display CSC properties: 1) CD44 + /CD24~/ 
Lineage" cells, the first described CSCs, found mostly in basal-type breast cancers 3 ; 2) Cancer cells that express 
higher levels of Aldehyde Dehydrogenase 1 (ALDH1 + ), which are present mostly in luminal breast cancers 4 . 
Additional markers that further refine CSCs including Delta-like (DLL), Delta/Notch-like EGF repeat containing 
(DNER), CD271, ganglioside GD2, and Dopamine receptors 3 and 5 have been reported 5 8 . 

Although definition of CSCs remains largely operational, CSCs might explain tumor heterogeneity, chemo- 
therapy/radiation resistance, and metastasis 1 . Endocrine- and chemotherapy- resistant luminal-type breast can- 
cers acquire CSC properties with concomitant loss of luminal features and gain of basal-like features 910 . 
Neoadjuvant trials with docetaxel or letrozole (endocrine therapy) have shown enrichment of CSCs in residual 
luminal tumors 11 . Elevated levels of CSCs in primary tumors correlates with higher tumor grade, brain and lung 
relapse, and poor outcome 12 . A meta-dataset analysis involving seven independent breast cancer gene expression 
datasets has identified enrichment of four gene expression signatures including normal mammary stem cells and 
embryonic stem cell signatures in higher-grade tumors with CSC phenotype 12 . 

Breast cancers are subclassified into five intrinsic subtypes 13 . Among these subtypes, claudin-low subtype is 
enriched for CSCs 14 . Claudin-low subtype breast cancers are triple negative breast cancers (TNBCs), which lack 
the expression of estrogen receptor (ER), progesterone receptor (PR), and HER2. Recent studies have further 
refined TNBCs into six subtypes based on gene expression patterns: basal-like 1 (BL-1), basal-like 2 (BL-2), 
mesenchymal (ML), mesenchymal- stem like (MSL), immunomodulatory (IM), and luminal androgen receptor 
(LAR) 15 . The gene expression pattern in MSL and ML subtypes overlaps with the gene expression pattern in CSCs 
and claudin-low subtype. Thus, three subtypes of breast cancers (claudin-low, MSL, and ML), high-grade breast 
cancers (G3), and tumors that are resistant to currently available therapies may require drugs that target CSCs. 
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Progress in developing drugs targeting CSCs has been slow. 
Salinomycin was recently suggested to preferentially target 
CD44 + /CD24± CSCs in in vitro studies 16 . However, it is less likely 
to enter the clinic because it is equally toxic to normal stem cells in 
vivo 8 . IL-8/CXCR1/CXCR2 pathway is being considered to target 
CSCs 17 . However, for immediate need, repurposing of existing 
FDA approved drugs with additional considerations for biomarkers 
of drug sensitivity is the best option, which was investigated in this 
study. 

Results 

Connectivity map (CMAP) reveals the effect of ATRA in reversing 
CSC-enriched gene expression pattern. With recent advances in 
genomics, we now have tools to revisit reasons for failures of 
previous clinical trials and to identify biomarkers of drug sensiti- 
vity. We approached this issue by combining cancer stem cell geno- 
mics with connectivity map (CMAP) 18 ' 19 . The CMAP is a database of 
gene expression profiles in four cell lines (MCF-7, HL-60, SKMEL5, 
and PC3) under treatment with differing concentrations of —1000 
FDA approved drugs. The database contains —6100 gene expression 



profiles resulting from treatment of cell lines with different 
concentrations of these drugs 18 . The gene expression profiles from 
CMAP can be compared with gene expression profiles in other 
experiments to investigate how much expression in a condition 
correlate with expression resulting from drug treatment. The correla- 
tion is given a score from + 1 (maximum positive correlation) to — 1 
(maximum negative correlation) based on the extent of correlation. 
Drugs that receive the score close to —1 are likely to have a 
therapeutic value since their gene expression profile is a reversal of 
profile present in the experimental condition. This approach has 
resulted in identification of Cimetidine, an antiulcer drug, as a 
potential therapy for lung cancer 19 . We performed CMAP analyses 
of gene expression datasets comparing MCF-10A CD44+/CD24— 
with CD44-/CD24+ subpopulation 20 , tumorigenic (CD44 + / 
CD24— /Lin— ) cells versus non-tumorigenic cells from primary 
tumors 21 , genes up or down-regulated in pooled normal and metas- 
tatic CD44+ breast cancer cells versus normal and metastatic 
CD24+ cells 22,23 , and transformed SSEA1+ CSC fibroblasts versus 
transformed SSEA1— fibroblasts 23 . Genes differentially expressed in 
CD271+ basal-like cells with CSC activity as well as in GD2- 
enriched cells, which overlap with CD44 + /CD24— cells, were 
included 6,7 . Table 1 provides a partial list of highly connected 
drugs. All-trans retinoic acid (ATRA, also called Tretinoin) and 
the PPARy agonist Poiglitazone (ACTOS) emerged as drug 
candidates that are negatively associated with CSC-enriched gene 
expression signatures. Since PPARy agonists have been withdrawn 
from market but Tretinoin and drugs such as bexarotene (Targretin) 
with properties overlapping ATRA are in clinical use 24,25 , we 
evaluated the ability of ATRA in ameliorating CSC properties. 

ATRA is negatively connected with gene expression pattern in ML 
and MSL subtypes of TNBCs. Since MSL and ML subtypes of 
TNBCs are enriched for CD44+/CD24— cells 15 , we subjected gene 
expression signatures of subtypes of TNBCs to CMAP. This analysis 
allowed us to determine whether gene expression pattern in MSL and 
ML but not other subtypes of TNBCs negatively correlates with 
Tretinoin treatment (Table 2). Tretinoin (ATRA) had a score of 
—0.962 and —0.841 for MSL and ML subtypes, respectively. Interes- 
tingly, few drugs demonstrated opposing scores for different 
subtypes of TNBCs. For example, Trichostatin A score was + 1 for 
LAR subtype, whereas it was — 1 for IM subtype. Metformin, which is 
now being evaluated in breast cancer clinical trials, as well as 
paclitaxel, were ineffective against BL-1 and BL-2 subtypes with 
both scoring positive values. These observations, if true in vivo, 
highlight the need for biomarker driven clinical trial, which can 
exclude patients who may perform poorly under a specific 
therapeutic regime. 

ATRA induces differentiation of fulvestrant-resistant breast cancer 
cells with acquired CD44+/CD24+ phenotype. CMAP results for 
breast cancer were derived from MCF-7 cells. These cells are estrogen 
receptor (ERa) -positive, proliferate in response to estradiol treatment, 
but are growth inhibited by anti-estrogens such as tamoxifen and 
fulvestrant. However, these cells, like in patients treated with anti- 
estrogens, eventually acquire resistance to treatment. To validate 
the results of CMAP analysis, we compared parental, 4-hydroxy 
tamoxifen-resistant (OHTR), and fulvestrant (Ful-R)-resistant cells 
for CD44 and CD24 status with or without ATRA treatment for 72 
hours 26 . Note that OHTR and Ful-R cells were derived from a single 
cell MCF-7 clone and thus represent cells that have acquired resis- 
tance to drugs rather than clonal expansion of intrinsically drug resis- 
tant cells 26 . MCF-7 and OHTR cells were predominantiy CD44-/ 
CD24+ with -3 and 12% being CD44+/CD24+ (Figure 1A). 
ATRA treatment reduced the levels of these double-positive cells. 
Ful-R cell line, which expresses very little ERa and displays epithe- 
lial to mesenchymal phenotype (EMT) 26 , contained equal number of 
CD44-/CD24+ and CD44+/CD24+ subpopulation. A recent study 
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Table 2 Connectivity map analysis of TNBC subtypes 
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6099 Paroxetine 


1 ( iM 
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demonstrated enrichment of CD44-/CD24+ and CD44+/CD24 + 
cells in luminal and basal- A cell lines, respectively 27 . The above results 
thus suggest that fulvestrant resistance involves luminal cells acquir- 
ing basal-A phenotype. Additionally, we had previously demon- 
strated that MCF-7 cells that acquired CD44+/CD24+ phenotype 
display CSC phenotype 20 . Ful-R cells treated with ATRA demons- 
trated reversal to CD44— /CD24+ phenotype as percentage of cells 
with CD44+/CD24+ phenotype reduced from 46 ± 3% to 13 ± 
0.5% (p < 0.01 untreated versus ATRA treated). The change in the 
CD44 expression status in Ful-R cells treated with ATRA is not likely 
due to direct effect of ATRA on CD44 transcription as CD44 gene 
lacks RAR binding sites. 

To further ascertain the effect of ATRA in inducing differentiation 
of Ful-R cells, we maintained these cells in ATRA containing media 
for one month and then evaluated cell morphology, and CD44/CD24 
expression status. Long-term ATRA treated cells reestablished cell- 
cell contact, lost some of the morphologic features of EMT including 
fibroblast-like appearance (Figure IB) and CD44+/CD24+ pheno- 
type (Figure 1C) suggesting reversal of EMT phenotype. 

OHTR resistance is associated with elevated CD271 positivity, 
which can be reversed by ATRA. To determine whether ATRA 
has any effect on CSC phenotype characterized using different 
markers, we examined MCF-7, OHTR, and Ful-R cells for CD271, 
which is a cell surface marker of a minority of basal-like cells with 



stem cell activity present in luminal cell lines such as MCF-7 6 . OHTR 
cells contained significantly higher CD271 -positive cells (73 ± 9%) 
compared to MCF-7 (34 ± 10%, p = 0.009, OHTR versus MCF-7) or 
Ful-R cells (40 ± 16%, p = 0.04, OHTR versus Ful-R) (Figure ID). 
Upon ATRA treatment, CD271-positive cells declined in all three- 
cell types, although maximum effects were observed in OHTR cells. 
Thus, ATRA reverses CSC phenotype based on two cell surface 
markers. 

ATRA reduces self-renewal as measured by mammosphere assay. 

We next examined the effect of ATRA on CSC phenotype of MCF-7, 
OHTR, and Ful-R cells in mammosphere assay. Although cell 
aggregates were detected within 2-3 days of plating, the majority 
of these aggregates disappeared after few days, and mammospheres 
appeared after seven days. However, we do acknowledge the 
limitations of this assay, as it is often difficult to distinguish mammo- 
spheres from anoikis-resistant cell aggregates. ATRA reduced the 
size of primary mammospheres formed by MCF-7 and OHTR cells 
(Figure 2A). ATRA reduced the size of Ful-R cell-derived primary 
and tertiary mammospheres (Figure 2A). To minimize cell aggrega- 
tion, we repeated mammosphere assay in media containing 1% 
methylcellulose, which increased media viscosity. ATRA reduced 
the size of mammospheres formed under this growth condition 
(Figure 2B). It is believed that the size of neurosphere reflects self- 
renewal rate 28,29 . Thus, the observed effect of ATRA on the size of the 
mammospheres may indicate its ability to slow the self-renewal 
process. Interestingly, ATRA had a modest growth stimulatory 
effect on Ful-R cells under 2D growth conditions (Figure 2C). 
Thus, appropriate growth conditions are required to assess the 
effects of ATRA and ATRA-resistant phenotype evident in 2D 
culture may not be manifested under 3D or stem cell culture 
conditions. 

Quantitating mammospheres is often difficult. We devised a new 
strategy for visualizing mammospheres by filtering through 40- 
micron filter to remove single cell and small aggregates. The mam- 
mospheres on the top of the filter were fixed and stained with 
Wright-Giemsa. ATRA treated Ful-R cells showed significantly 
lower number of secondary and tertiary mammospheres compared 
to control cells (Figure 2D and data not shown). 

We next performed cell cycle analysis of Ful-R cells grown under 
mammosphere condition with or without ATRA treatment. Untreated 
and ATRA-treated mammospheres showed typical dividing cell cycle 
pattern (Figure 2E). However, ATRA-treated mammospheres had 
higher percentage of debris indicating elevated cell death. We mea- 
sured apoptosis in mammospheres under untreated and ATRA treated 
condition using Annexin V and flow cytometry. Mammospheres were 
trypsinized and single cells were stained for Annexin V and propidium 
iodide. ATRA treated mammospheres demonstrated ~3-fold incr- 
ease in apoptosis/necroptosis compared to untreated mammospheres 
(Figure 2F). These results indicate a pro-apoptotic function of ATRA, 
which is not usually observed under 2D growth conditions. 

Cell type specificity in ATRA action on MSL subtype of TNBCs. To 

further demonstrate an effect of ATRA on CSC phenotype, we 
subjected MDA-MB-231 and MDA-MB-436 cells, both representing 
MSL subtype of TNBCs 15 , to mammosphere assay with or without 
ATRA treatment. The size of mammosphere was significantly 
reduced in MDA-MB-436 cells treated with ATRA compared to 
untreated controls (Figure 2G). Although ATRA reduced the size of 
mammospheres in MDA-MB-231 and TMD-231 cells, the effect was 
modest in both primary and tertiary mammospheres (Figure 2G). 
ATRA-treated MDA-MB-436 cells formed considerably lower num- 
ber of secondary and tertiary mammospheres, although it had no effect 
on proliferation of cells under 2D culture (Figure 2H, I). These results 
suggest that ATRA has cell type specific effects in reversing CSC 
properties based on mammosphere assay. 
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Figure 1 | ATRA reverses CD44+/CD24+ phenotype of Ful-R cells. (A) Parental MCF-7, tamoxifen-resistant (OHTR), and Fulvestrant-resistant (Ful- 
R) variants were treated with ethanol or ATRA (1 (J.M) for 72 hours and subjected to flow cytometry as indicated. Percentage of CD44+/CD24+ Ful-R 
cells changed from 46 ± 3 to 13 ± 0.5 upon ATRA treatment (p = 0.003). (B) Morphological changes in Ful-R cells upon long-term exposure to ATRA. 
Cells were passaged for one month with or without ATRA treatment and photographed. (C) Cell surface CD44 and CD24 expression status of long-term 
ATRA treated Ful-R cells. (D) The effects of ATRA on CD271 + CSCs. MCF-7, OHTR, and Ful-R cells were treated with vehicle or ATRA ( 1 uM) for 72 
hours and cell surface expression of CD271 was measured by flow cytometry. Representative isotype control (purple), untreated (green), and ATRA 
treated cell (pink) histograms from three experiments are shown. 
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ATRA-mediated reduction in CSC properties correlates with 
repression of select CSC-associated genes. Several reports have 
shown induction of CSC-like phenotype upon EMT and non-CSCs 
acquiring CSC properties upon overexpression of EMT-associated 
genes such as SNAI2/SLUG 203031 . Another recent study demon- 
strated a role of embryonic lineage commitment gene SOX2 in 
inducing CSCs and both SOX2 and SLUG are enriched in the 
recently described CD271-positive basal-like CSCs 632 . To identify 
potential targets of ATRA in cancer cells that have acquired CSC 
properties through EMT process, we performed EMT array analysis 
of untreated and ATRA treated Ful-R cells. These preliminary 
analyses indicated ATRA repressing AHNAK, CAV2, CDH2, 
EGFR FGFBP1, IGFBP4, JAG1, MMP9, Notch 1, SERPINE1, 
SNAI2/SLUG, and TGF)3l (data not shown). None of these genes 
was repressed by ATRA in partially responsive MDA-MB-231 cells 
(data not shown). EGFR is of specific interest as the tumor-specific 
constitutively active variant III is associated with CD44+/CD24— 
subpopulation of breast CSCs 33 . Consistent with this possibility, 
EGFR showed cell type specific differences in expression as well as 
protein species. MCF-7, OHTR, and Ful-R cells expressed EGFR of 
different mobility with Ful-R cells expressing a low molecular weight 
EGFR (Figure 3A). EGFR protein levels were higher in OHTR and 
Ful-R cells compared to parental cells (Figure 3A). Irrespective of 
levels and isoforms, ATRA reduced the levels of EGFR in all three- 
cell types (Figure 3A). 

The expression levels of EGFR, SERPINE1, SOX2, and SLUG in 
multiple cell types were examined by qRT-PCR. Basal expression of 
these genes was higher in Ful-R cells compared to MCF-7 or OHTR 
cells although statistical significance was achieved only with SOX2 
and SLUG due to experimental variability (Figure 3B). OHTR and 
Ful-R cells expressed higher levels of SOX2 compared to MCF-7 (p < 
0.004, Figure 3B, top left). ATRA reduced EGFR, SERPINE1, SOX2, 
and SLUG in Ful-R cells but only SOX2 in OHTR cells (Figure 3B, 
bottom center and right). Because of very low basal expression, the 
effect of ATRA on SOX2 expression in MCF-7 cells could not be 
reliably measured. 

To further extend the correlation between ATRA-mediated reduc- 
tion in CSC properties and loss of SOX2 expression, we measured the 
effect of ATRA on SOX2 expression in MDA-MB-231 and MDA- 
MB-436 cells. ATRA repressed SOX2 expression in only MDA-MB- 
436 cells (Figure 3C). It appears that MDA-MB-231 cells are not 
dependent on SOX2 to maintain CSC phenotype because the basal 
expression of this gene was 2.5 fold lower compared to MDA-MB- 
436 cells (Figure 3D). Thus, ATRA-mediated effects on CSCs con- 
sistently correlated with repression of SOX2 and EGFR. 

We next examined whether EGFR, SERPINE1, SLUG, and SOX2, 
which are repressed by ATRA in a cell type dependent manner, 
constitute a prognostic signature in breast cancer using the publicly 
available database 34 . Elevated combined expression levels of these 
four genes were associated poor recurrence-free and distant meta- 
stasis-free survival outcome in patients with ER-negative breast can- 
cer (Figure 4A-B). Analysis of different subtypes of breast cancer 
showed prognostic relevance of these genes in basal subtype but not 
luminal A, luminal B or HER2 subtypes (Figure 4C-D and data not 
shown). SOX2 plus EGFR had similar prognostic impact on ER- 
negative but not specific subtypes of breast cancer (data not shown). 
Collectively, these results illustrate that CSC-associated genes that 
are targets of ATRA have prognostic implications in breast cancer. 

ATRA induces the expression of SOX2 antagonist CDX2. Agonistic 
and antagonistic interactions between cell type specific transcription 
regulators and SOX2 are required for cell fate determination during 
development and to maintain homeostasis in adult tissues 35 . We first 
surveyed these SOX2-associated transcription factors (Oct4, Tbx6, 
Pax6, MITF, Nkx2.1, and Cdx2) for prognostic relevance in breast 
cancer using public database 34 , their relationship to ATRA signaling, 



and direct regulation by SOX2. This preliminary screening suggested 
CDX2 as a SOX2 antagonist that is likely to be under the control of 
ATRA. SOX2 has previously been shown to repress CDX2, whereas 
CDX2 represses SOX2 3637 . ATRA increased CDX2 expression in Ful- 
R cells (Figure 5A). ATRA had no effect on CDX2 expression in 
MDA-MB-231 and MDA-MB-436 cells (data not shown). These 
results show ATRA-mediated loss of CSC phenotype in certain cell 
types is associated with altered SOX2:CDX2 ratio. 

SOX2/CDX2 ratio has prognostic implication in breast cancer. 

CDX2 has not been studied extensively in breast cancer. However, 
previous studies have shown a good prognostic value of CDX2 in 
gastric, ovarian, and pancreatic cancer 3840 . Oncomine analysis of 
TCGA data 41 revealed similar CDX2 expression levels between 
adjacent normal breast and ER-positive breast cancer but reduced 
expression in ER-negative breast cancer (Figure 5B). In contrast, 
SOX2 expression was higher in both ER-positive and ER-negative 
breast cancer compared to normal adjacent tissue (Figure 5B). The 
concept map analysis using Oncomine revealed that CDX2 as one of 
the top 1% of the underexpressed gene in all but one breast cancer 
datasets. Similar analysis of SOX2 did not show any specific pattern 
(data not shown). Since SOX2 and CDX2 have been shown to repress 
each other's expression, we next asked whether ratio between two 
genes has an impact on breast cancer outcome. NKI dataset is the 
most widely used dataset of this type of studies 42 . Higher SOX2/ 
CDX2 ratio was associated with poor recurrence-free survival 
(Figure 5C). In addition, elevated SOX2/CDX2 ratio correlated 
with worst outcome in patients who did not receive hormonal 
therapy (mostly ER-negative) or received chemotherapy (Figure D, 
E). Similar results were obtained using a different dataset (data not 
shown) 43 . These results clearly demonstrate relevance of SOX2 
and CDX2 ratio in breast cancer outcome. Our results show the 
ability of ATRA to reverse this ratio in a cell type-dependent 
manner. However, our attempts to knockdown SOX2 and CDX2 
expression to significant levels in Ful-R cells using siRNA were not 
successful. Therefore, functional role of these genes in mediating 
ATRA effects on mammosphere needs to be verified through 
further experimentation. 

PD0325901 (Selumetinib), a MEK inhibitor, sensitizes MDA-MB- 
231 cells to ATRA. Despite displaying CD44 + /CD24- features and 
characteristics of MSL subtype of TNBC, which scored negatively 
with ATRA in CMAP, MDA-MB-231 cells were resistant to ATRA. 
To determine the potential mechanisms of ATRA resistance of these 
cells, we analyzed COSMIC (Catalogue of Somatic Mutations in 
Cancer) database of Sanger Institute for cancer-associated mutations 
that confer resistance to ATRA. Among nine cancer-associated 
mutations that determine sensitivity to ATRA (Notch 1, BCR_ABL, 
KIT, FLT3, APC, TET2, K-ras, ALK, MLL_AFF1), KRAS, APC, and 
KIT mutations are associated with resistance to ATRA (Figure 6A, 
only K-ras is shown). MDA-MB-231 cells carry K-ras mutation 
(G13D) raising the possibility that this mutation contributes to 
ATRA resistance 44 . We used the same database to determine drugs 
that are effective against cell lines with K-ras mutation. PD0325901, a 
MEK inhibitor, and AZ628, a C-RAF inhibitor, were effective against 
cell lines with K-ras mutation (Figure 6B). Since PD0325901 
(Selumetinib) is already in clinic, we examined its ability to inhibit 
proliferation of MDA-MB-231 in 2D culture and to form mammos- 
pheres with or without ATRA treatment. PD0325901 had minimum 
effect on MDA-MB-231 proliferation in 2D cultures with or without 
ATRA treatment (Figure 6C) but substantially reduced mammos- 
phere formation when combined with ATRA (Figure 6D). These 
results suggest that genomic make up of cancer cells can be utilized 
to develop combination therapies involving pathway-specific drugs 
and ATRA. 
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Discussion 

Despite controversies surrounding the cancer stem cell hypothesis, 
patients with tumors that are enriched for gene expression signatures 
of CSCs encounter rapid disease progression and poor outcome 12 . 
EMT is one of the mechanisms by which cancer cells acquire CSC 
properties 2030 . More often, residual tumors after therapy express 
higher levels of EMT-inducing genes, display elevated mammo- 
sphere forming ability, and have elevated TGF|3-signature score 11,45 . 
At present, there are no drugs in clinical use that target CSCs. 
Considering this need, we adapted CMAP approach. ATRA iden- 
tified in this screen is more effective in reducing CSC phenotype and 
CSC-associated gene expression in cancer cells that have acquired 
CSC phenotype during the course of developing resistance to a tar- 
geted therapy. Ful-R cells, a fulvestrant resistant cell line derived 
from a single cell clone of MCF-7, were more sensitive to ATRA than 
MDA-MB-436 and MDA-MB-231 cells, which most likely represent 
intrinsic CSCs 31 . 



ATRA, which came into solid tumor oncology clinic after extens- 
ive preclinical studies, performed very poorly in clinical trials. A 
recent clinical trial of ATRA in combination with taxol in patients 
with recurrent or metastatic disease revealed overall clinical benefit 
of 76.4% with a relatively high rates of stable disease 46 . However, 
these studies were not biomarker driven. Our results demonstrating 
ATRA being effective against tumor cells that have acquired CSC 
phenotype suggests its utility in a specific group of patients who has 
failed hormonal therapy. Recent neoadjuvant trials with hormonal 
therapy have shown enrichment of cancer cells with CSC properties 
in residual tumors 11 . Whether combining ATRA with anti-hormonal 
therapy will be an effective strategy to eliminate CSCs remains to be 
determined. 

ATRA reduced mammosphere-forming ability of cell lines that 
expressed higher levels of SOX2 suggesting that only the cancer cells 
that are dependent on SOX2 for self-renewal are responsive to 
ATRA. ATRA also reduced the levels of EGFR, SERPINE1, and 
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SLUG in a cell type-dependent manner. Both SERPINE1 and SLUG 
overexpression is linked to higher tumor grade and poor outcome in 
breast cancer patients 12 . Interestingly, in hepatocellular carcinoma, 
SOX2 induces the expression of SLUG 47 . SLUG is the main sup- 
pressor of human breast progenitor cell lineage commitment and 
differentiation and is aberrantly expressed in BRCA1 mutant tissue 48 . 
EGFR and SOX2 constitute a feedback loop that positively regulates 
self-renewal of neural stem cells 49 . The ability of ATRA to inhibit 
EGFR and SOX2 in all responsive cell lines, and SERPINE1 and 
SLUG in few cell types and repression of these genes correlating with 
diminished CSC phenotype raise the possibility of using these genes 
as biomarkers to distinguish ATRA responders from non-respon- 
ders. CDX2, an antagonist of SOX2, can also be included as a 



biomarker as it is ATRA-inducible in responsive cell line. 
Consistent with this possibility, our analysis of gene expression array 
datasets revealed the prognostic relevance of these ATRA targets 
(Figure 4 and 5). 

Recent studies in pancreatic cancer have identified biomarkers of 
ATRA response 50 . Two intracellular ATRA binding proteins, FABP5 
and CRABPII, determine whether ATRA activates RAR:RXR signal- 
ing or RXR:PPARp/8 signaling 51 ' 52 . FABP5 directs ATRA to RXR/ 
PPARR/5 signaling, whereas CRABPII directs ATRA towards 
RAR:RXR signaling. Therefore, pancreatic cancer with lower levels 
of FABP5 retain sensitivity to ATRA compared to cells with high 
level of this intracellular retinoid binding protein 50 . An inverse cor- 
relation was observed with CRABPII. If FABP5 and CRABPII 
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function similarly in breast tissues, CDX2, EGFR, SERPNIE1, SLUG, 
SOX2, FABP5, and CRABPII may serve as biomarkers of ATRA 
response. Additional preclinical studies are required to test this 
possibility. 

MDA-MB-231 cells, which are CD44+/CD24- but appear 
SOX2 -independent, were least sensitive to ATRA. However, this cell 
line has K-ras mutation, which is known to confer resistance to 
ATRA 53 . Inhibition of MEK, the downstream target of K-ras, 
restored sensitivity to ATRA in mammosphere assay. Thus, K-ras 
mutation and MEK activation status are additional biomarkers of 
ATRA sensitivity. 

ATRA works through nuclear receptors, which are expressed at 
variable levels in different subtypes of breast cancer including can- 
cers enriched for CSCs. Cell lines used in this study express all three 
RARs and RXRs based on the analysis of mRNA levels available 
publicly 54 . Similar to our study, Ginestier et al showed the ability of 
ATRA to inhibit mammosphere formation by ALDH1 -positive 
CSCs 55 . Papi et al recently demonstrated the effect of ATRA and 
the RXR-specific ligand 6-OH-ll-O-hydroxyphenanthrene in redu- 
cing CSC phenotype of breast cancer cells by targeting NF-kB path- 
way 56 . Our study clearly shows not all cancer cell lines with CSC 
phenotype are responsive to ATRA suggesting the existence addi- 
tional determinants of ATRA action on CSCs. As ATRA repressed 
SOX2 in only responsive cell lines, understanding how ATRA 
represses SOX2 may provide insights into mechanisms and barriers 
involved in ATRA action. Collectively, our results should stimulate 
interest in developing ATRA based therapy for specific subtypes of 
breast cancer, which additionally considers biomarker driven patient 
selection and cancer genome-based combination therapies. 

Methods 

Cell lines. A single cell derived MCF-7 clone expressing ERE-Luciferase (parental), 4- 
hydoxy tamoxifen-resistant variant (OHTR) and fulvestrant- resistant variant (Ful-R) 
cell lines have been described previously and are generous gift from Dr. Ken 
Nephew 26 . MDA-MB-231, TMD-231, and MDA-MB-436 cells have been described 
previously 31 . Most of the studies with MDA-MB-231 cell line were done using a 
variant selected from a mammary fat pad tumor in nude mice (TMD-231). All cell 
lines were maintained in phenol red-free MEM with five percent dextran- charcoal 
treated fetal calf serum for at least four days prior to starting experiments and all 
experiments except mammosphere assays were done in this medium. 

Mammosphere assay. 5,000 to 20,000 cells were plated on ultra-low adherent six well 
or 100 mm plates in MammoCult media from Stemcell Technologies (Vancouver, 
BC, Canada) as per instruction from manufacturers. Although few clumps with 10- 
20 cells were observed a day or two after plating, these clumps disappeared and 
mammospheres were detected by seven days. Mammospheres were visualized and 
photographed. For secondary and tertiary cultures, mammospheres were collected by 
centrifugation, washed, trypsinized, and equal number of cells were replated in 
mammosphere media. At the end of the experiment, mammospheres were filtered 
through 40-microfilter. Filters were stained with Wright-Giemsa stain (Fisher 
Diagnostics, VA, USA) and mammospheres were counted. 

RNA extraction and quantitative reverse transcription polymerase chain reaction 
(qRT-PCR). RNA was isolated using the RNAeasy kit from Qiagen (Valencia, CA, 
USA). Single strand cDNA was synthesized using single strand synthesis kit (Bio-Rad 
Laboratories, Hercules, CA). qPCR was performed in duplicate measurements using 
syber-green PCR mix (Applied Biosystems) and specific primers on the qPCR 
instrument (Applied Biosystems). [3-actin served as a normalization control. 
Sequences of primers used are provided in supplementary Table 1. EMT array 
analysis was done using PAHS-090 array from SA Biosciences (Valencia, CA, USA) as 
per instructions from the manufacturer. 

Analysis of publicly available databases. Oncomine database was used to determine 
the expression pattern of SOX2 and CDX2 in TCGA dataset 41 , whereas in-house 
database was used to determine the prognostic importance of SOX2/CDX2 ratio in 
NKI dataset 42 . Gene array datasets used for connectivity map are described in the text. 
The prognostic value of combined EGFR, SLUG/SNAI2, SERPINE1, and SOX2 was 
determined using KMplot 34 . 

Cell extracts and western bot analysis. Whole cell lysates were prepared in RIPA 
buffer and western blotting was preformed as described previously 20 . Antibody 
against EGFR was purchased from Santa Cruz biotechnology (Santa Cruz, CA, USA). 



Cell cycle, apoptosis, and flow cytometry. Flow cytometry was performed as 
described previously 31 . CD44-FITC and CD24-PE antibodies were purchased from 
BD Biosciences (San Jose, CA, USA), whereas CD271-APC antibody was from 
Biolegend (San Diego, CA, USA). Cell cycle analysis was performed as described 
previously 57 . Apoptosis assay of mammospheres was performed using Annexin V 
apoptosis assay kit as recommended by manufacturers (Invitrogen, Carlsbad, CA, 
USA). Representative data from three or more experiments are presented. 

Statistical analysis. qRT-PCR, flow cytometry and proliferation assay results were 
subjected unpaired f test using GraphPad Prism software (Graphpad.com). The two- 
tailed p values of <0.05 were considered statistically significant. 
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